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Abstract 

Background: Descendants from the extinct aurochs {Bos primigenius), taurine {Bos tourus) and zebu cattle 
{Bos indicus) were domesticated 10,000 years ago in Southwestern and Southern Asia, respectively, and colonized 
the world undergoing complex events of admixture and selection. Molecular data, in particular genome-wide single 
nucleotide polymorphism (SNP) markers, can complement historic and archaeological records to elucidate these 
past events. However, SNP ascertainment in cattle has been optimized for taurine breeds, imposing limitations to 
the study of diversity in zebu cattle. As amplified fragment length polymorphism (AFLP) markers are discovered and 
genotyped as the samples are assayed, this type of marker is free of ascertainment bias. In order to obtain unbiased 
assessments of genetic differentiation and structure in taurine and zebu cattle, we analyzed a dataset of 135 AFLP 
markers in 1,593 samples from 13 zebu and 58 taurine breeds, representing nine continental areas. 

Results: We found a geographical pattern of expected heterozygosity in European taurine breeds decreasing 
with the distance from the domestication centre, arguing against a large-scale introgression from European or 
African aurochs. Zebu cattle were found to be at least as diverse as taurine cattle. Western African zebu cattle were 
found to have diverged more from Indian zebu than South American zebu. Model-based clustering and ancestry 
informative markers analyses suggested that this is due to taurine introgression. Although a large part of South 
American zebu cattle also descend from taurine cows, we did not detect significant levels of taurine ancestry in 
these breeds, probably because of systematic backcrossing with zebu bulls. Furthermore, limited zebu introgression 
was found in Podolian taurine breeds in Italy. 

Conclusions: The assessment of cattle diversity reported here contributes an unbiased global view to genetic 
differentiation and structure of taurine and zebu cattle populations, which is essential for an effective conservation 
of the bovine genetic resources. 
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Background 

As a consequence of over 10,000 years of domestication, 
migrations and natural as well as artificial selection, a 
wide range of phenotypically distinct cattle populations 
spread around the world. Several research initiatives have 
combined molecular marker datasets with historic and 
archaeological records in order to investigate the origin, 
history, genetic diversity, and differentiation of cattle pop- 
ulations (see Groeneveld et aL, 2010 [1] for a review on 
the topic). The collected evidences suggest that domestic 
cattle descend from the extinct aurochs {Bos primigenius) 
and are divided into two distinct but interfertile species: 
the humpless taurine cattle {Bos taurus) and the humped 
indicine or zebu cattle {Bos indicus). It is accepted that 
taurine and zebu cattle have arisen from separate centres 
of domestication about 8,000 years BC in the Fertile Cres- 
cent (modern-day countries of Israel, Jordan, Lebanon, 
Cyprus and Syria, and parts from Egypt, Turkey, Iraq, Iran 
and Kuwait) and the Indus valley (current Pakistan), re- 
spectively [2,3]. From these regions, cattle have spread 
throughout Europe, Asia and Africa due to the expansion 
of agriculture [4,5]. Taurine cattle were imported to the 
American continent after 1492, mainly from Iberian im- 
portations; in the early 20* century, Indian zebu cattle 
were introduced in Central and South America because of 
their adaptability to the tropical environment. 

Molecular markers have been essential to the investiga- 
tion of the history and genetic differentiation of domestic 
cattle. Recent studies applying genome-wide single nu- 
cleotide polymorphism (SNP) markers to investigate gen- 
etic structure and differentiation in multiple cattle breeds 
(e.g., [6-9]) resolved hypotheses that were not possible to 
be tested by using sparse panels of molecular markers. 
However, markers included in the most widely used SNP 
panel, the Illumina® BovineSNPSO BeadChip assay (50 k), 
were discovered in reduced representation libraries from 
pooled DNA samples of six taurine breeds [10], which 
leads to biased estimates of genetic structure and differ- 
entiation in zebu cattle [7]. 

As an alternative, amplified fragment length poly- 
morphism (AFLP) markers [11] have been used for al- 
most two decades. Due to their random nature and high 
reproducibility, they have been enabling ascertainment 
bias-free analysis of diversity in any species since before 
the advent of high throughput genotyping and sequen- 
cing technologies [12]. AFLP markers are produced by 
digesting genomic DNA with both a rare cutter and a 
frequent cutter restriction enzyme, with subsequent 
ligation of synthetic adapters to the restriction fragments 
to serve as primer-binding sites, and selective amplifica- 
tion of subsets of the restriction fragments with primers 
carrying additional nucleotides at their 3' end [13]. 

Although AFLP markers are highly informative [13-15] 
and unbiased, there are few examples of the application of 



this type of marker in multiple breed, large-scale popula- 
tion differentiation analysis in cattle. Negrini et al [16] 
used 81 AFLP and 19 microsatellite markers to estimate 
genetic distances among 51 breeds of cattle, including 
taurine and zebu cattle, and found that the AFLP panel 
could differentiate between zebu and taurine cattle better 
than the panel of microsatellites. Two studies in pigs 
[17,18] showed the potential of AFLP to survey genetic 
diversity at the continental scale. Because AFLP polymor- 
phisms are mainly (but not exclusively) based on point 
mutations, these markers are expected to indicate evolu- 
tionary divergence better than microsatellites with vari- 
able mutation rates. For instance, a microsatellite-based 
bovine phylogeny [19] was not in agreement with a phyl- 
ogeny based on sequence data [20], which was not the 
case for an AFLP-based phylogeny [21]. Thus, AFLP ap- 
pears to be a valuable complementary tool for studies of 
genetic diversity in cattle populations around the world. 

Aiming at an unbiased view of genetic structure and 
differentiation between taurine and zebu cattle breeds 
from distinct continental areas, we compiled a worldwide 
multi-breed AFLP dataset. We do not intend to suggest 
the use of sparse panels of molecular markers over the 
present portfolio of high-density SNP arrays, or to inter- 
rogate their legitimacy for diversity research in cattle. In- 
stead, we intend to propose an unbiased model of cattle 
differentiation which complements the assessment of 
genetic distance estimates obtained from molecular 
markers that are likely to suffer from ascertainment bias. 

Methods 

Sampling and molecular data 

A total of 1,593 individuals were genotyped for 135 AFLP 
markers, representing 13 zebu and 58 taurine breeds. The 
presence (genotype 1') or absence (genotype '0') of a band 
was scored considering AFLP as dominant markers, and 
occasional faint bands were considered as missing data. 
These samples were obtained from 23 countries from 9 dis- 
tinct continental areas: Southern Asia (3 zebu breeds). 
Southwestern Asia (2 taurine breeds). Eastern Europe (3 
taurine breeds). Central Europe (24 taurine breeds). North- 
ern Europe (10 taurine breeds). Southern Europe (10 tau- 
rine breeds). Western Europe (8 taurine breeds). Western 
Africa (7 zebu breeds and 1 taurine breed), and South 
America (3 zebu breeds). This dataset builds on the data 
reported by Negrini et al [16] by inclusion of samples of 20 
additional breeds (Table 1). Individuals or markers present- 
ing 5% or more missing data were excluded from the study. 
Further details on the AFLP protocol and repeatability of 
the genotypes obtained can be found in Additional file 1. 

Genetic distances and distance-based clustering 

We used AFLPsurv vl,0 [22] to calculate three differ- 
ent measures of pairwise genetic distances between 
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Table 1 Continental areas, countries and breeds of taurine and zebu cattle sampled 



Continental area 


Country 


Breed 


Zebu 


Taurine 


Code 


n^ 


nQC'' 


Southern Asia 


India 


Mariana 


X 




HAR 


4 


4 






Tharparkar 


X 




THA 


4 


4 




Pakistan 


Sahiwal 


X 




SHA 


4 


4 


Southwestern Asia 


Turkey 


Anatolian Black 




X 


ANB 


24 


23 






Turkish Gray 




X 


TGS 


24 


23 


Eastern Europe 


Hungary 


Hungarian Gray 




X 


HUG 


22 


19 




Croatia 


Istrian 




X 


ISR 


24 


23 




Poland 


Polish Red 




X 


POR 


23 


21 


Central Europe 


Belgium 


Belgian Blue 




X 


BEB 


27 


24 




France 


Blond d'Aquitaine 




X 


BLM 


20 


19 






Bretonne Pie Noir 




X 


BPN 


22 


19 






Charolais 




X 


CHA 


22 


21 






French Limousine 




X 


LIM 


25 


21 






Jersey 




X 


JER 


18 


14 






Maine-Anjou 




X 


MAI 


20 


18 






Montbeliard 




X 


MON 


22 


22 






Normande 




X 


NOR 


23 


22 






Parthenaise 




X 


PAR 


15 


15 






Salers 




X 


SAL 


20 


20 




Switzerland 


Brown Swiss 




X 


SWB 


23 


20 






Eringer 




X 


ERI 


19 


19 






Evolene 




X 


EVO 


9 


8 






Simmentaler 




X 


SIM 


21 


19 




Italy 


Bruna 




X 


BRU 


33 


29 






Frisona 




X 


FRI 


47 


44 






Grigio Alpina 




X 


GAL 


21 


19 






Italian Limousine 




X 


LMI 


22 


19 






Piedmontese* 




X 


PIM 


22(2 r) 


21 






Pezzata Rossa Italiana 




X 


PRI 


22 


22 






Rendena* 




X 


REN 


24(22*) 


24 






Valdostana Pezzata Rossa 




X 


VPR 


22 


22 




Germany 


Original German Black Pied 




X 


GBP 


20 


20 




Austria 


Pinzgauer 




X 


PIG 


24 


22 


Northern Europe 


England 


Aberdeen Angus 




X 


ABA 


20 


15 




Norway 


Blacksided Trondheim 




X 


BTR 


22 


21 






Telemark 




X 


TEL 


22 


22 






Vestland Red Polled 




X 


VPO 


22 


18 




Denmark 


Danish Red 




X 


DAR 


22 


21 






Jutland 




X 


JUT 


22 


18 




Finland 


Eastern Finn Cattle 




X 


EEC 


22 


21 






Finnish Ayrshire 




X 


FAY 


22 


20 




Iceland 


Iceland Cattle 




X 


ICE 


22 


22 




Sweden 


Swedish Red Polled 




X 


SRP 


22 


20 


Southern Europe 


Italy 


Cabannina* 




X 


CAB 


22(20*) 


20 
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Table 1 Continental areas, countries and breeds of taurine and zebu cattle sampled (Continued) 





Calvana 




X 


CAL 


40 


38 




Chianina* 




X 


CHI 


22(21*) 


20 




Cinisara 




X 


ON 


8 


7 




Marchigiana 




X 


MCG 


22 


20 




Maremmana 




X 


MAR 


45 


45 




Modicana 




X 


MOD 


12 


12 




Mucca Pisana 




X 


MUP 


40 


39 




Podolica* 




X 


POD 


22(22*) 


20 




Romagnola 




X 


ROM 


20 


19 


Western Europe Spain 


Asturiana de los Valles 




X 


RAV 


20 


19 




Betizu 




X 


BET 


20 


18 




DiLidia 




X 


DID 


20 


19 




Menorquina 




X 


MEN 


20 


19 




Rubia Gallega 




X 


RUG 


20 


20 




Sayaguesa 




X 


SAY 


20 


19 




Tudanca 




X 


TUD 


20 


18 


Western Africa Cameroon 


Banyo Gudali 


X 




CBG 


26 


18 




Cameronian Red Bororo 


X 




CRB 


25 


20 




Cameronian White Fulani 


X 




CWF 


23 


23 




Ngaoundere Gudali 


X 




CNG 


25 


18 


Guinea-Bissau 


Guinean N'Dama 




X 


GND 


20 


19 


Nigeria 


Red Bororo 


X 




NRB 


25 


24 




Sokoto Gudali 


X 




NSG 


25 


25 




White Fulani 


X 




NWF 


25 


24 


Soutli America Brazil 


Guzerat 


X 




GUZ 


32 


32 




Nellore 


X 




NEL 


32 


21 




Tabapua 


X 




TAB 


32 


32 


Total 










1,593 


1,470 


^n: Number of samples before quality control, 
"^n QC: Number of samples after quality control. 

Quality control was performed by removing samples with 5% or more missing data. 

*Breed/number of individuals used to test the repeatability of AFLP fingerprinting (see Additional files 1, 2, 3, 4, 5, 6, 7, i 
Underlined breed names correspond to the samples described previously by Negrini et ai [16]. 


I and 9). 







populations: Fst [23], Neis D [24] and Reynolds' distance 
[25]. We grouped animals according to breed or continen- 
tal area. The three Southern Asian breeds were excluded 
in the analyses for individual breeds because of their low 
sample size (n = 12). We used the base package in R 
vllS.O [26] to perform spectral decompositions on the 
matrices of pairwise genetic distances between groups in 
order to construct low-dimensional representations of the 
genetic relationships among the surveyed populations. 
The dissimilarities between pairs of groups were captured 
in n-l dimensional spaces of n observations (eigenvectors), 
where n is the number of groups, via classical multi- 
dimensional scaling (CMDS) [27]. The proportion of gen- 
etic variance explained by each eigenvector was calculated 
by dividing its respective eigenvalue by the sum of all 



eigenvalues, and expressed as percentages. Additionally, 
we applied the Neighbor-Net method to the distance 
matrices by using SPLITSTREE v4J3J [28]. 

Expected heterozygosity and ancestry informative 
markers 

With the particular interest of identifying geographical 
patterns in the extent of genetic diversity in the cattle 
breeds analyzed, we used AFLPsurv vl.O [22] to calculate 
expected heterozygosities for each continental area 
under the assumption of Hardy- Weinberg equilibrium. 
Essentially, the same values were obtained averaging per 
area over the expected heterozygosities of the separate 
breeds (data not shown). Additionally, we applied an 
ad hoc statistic to identify taurine and zebu ancestry 
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informative markers (i.e., AFLP markers with large dif- 
ferences in band presence frequencies between taurine 
and zebu breeds). For each AFLP marker, we computed 
the band presence frequency across all breeds, and then 
calculated the mean for the pool of taurine and zebu 
breeds. We then calculated the difference in band pres- 
ence frequency as Af= f taurine -fzebw Positive and nega- 
tive values indicate markers that are informative of 
taurine or zebu ancestry, respectively. We used thresh- 
olds of +0.55 and -0.55 to identify taurine and zebu an- 
cestry informative AFLP markers, respectively. Finally, 
the average of band presence frequency of informative 
markers was computed for each breed in order to assess 
the relative level of taurine/zebu introgression across the 
investigated breeds. 

Model-based clustering 

We estimated individual ancestry coefficients as parame- 
ters of a statistical model, following the Bayesian ap- 
proach implemented in STRUCTURE v2.2 [29]. This is 
referred as the admixture model adapted for AFLP 
markers with independent allele frequencies (see [29,30] 
for details). Briefly, it is assumed that the genomes of 
the sampled individuals derive from one or more of K 
ancestral populations, and the proportion of the individ- 
uals' ancestry from each one of these populations is esti- 
mated via a Markov chain Monte Carlo algorithm. The 
assumption that the alleles are independent (i.e. linkage 
equilibrium) is reasonable in the present study, as the 
AFLP panel used is sparse and the markers are unlikely 
to be closely located on the genome. We applied this 
model from K = 1 to K = 60, and ran 5 replicates of 
150,000 iterations for each analysis after a burn-in of 
100,000 iterations [31]. 

We applied two methods to identify the most likely 
number of ancestral populations underlying the ob- 
served data. The first method uses the A/C statistic de- 
scribed by Evanno et al [32], which is based on the rate 
of change in the log-likelihood of data between succes- 
sive K values. The second method was abstracted from 
the approaches for model selection reviewed by Johnson 
& Omland [33], and is based on the concept of relative 
likelihood. First, the Akaike Information Criterion {AIC) 
is calculated for each model, from K = 1 to /C = 60, as 
follows: AIC =2p -2 ln(L), where p and ln(L) are the 
number of parameters and the log-likelihood of the esti- 
mated model, respectively. Next, AIC differences are cal- 
culated for each model / as Ai = AICi-AIC^i^, where 
Alexin corresponds to the lowest AIC among all models; 
and relative likelihoods are computed as Li = e"^/^^' . 
Then, relative likelihood values are normalized across all 

K models to produce Akaike weights coi = Li/ y^^^^Ly . 

These can be interpreted as the probability that the 



respective model is the one that presents the minimum 
information loss among all competing models, and was 
used as an alternative approach to estimate the optimal 
number of K. 

Results 

Quality control 

After the exclusion of individuals exhibiting 5% or more 
missing genotypes, 1,470 animals remained from the ini- 
tial set of 1,593 (see Table 1 for details). From a total of 
135 genotyped AFLP loci, 8 were excluded due to miss- 
ing data (>5%), and the final set of AFLP markers in- 
cluded 127 loci. As most of the analyses reported 
hereafter assume marker neutrality, the impact of the in- 
clusion of putative markers under selection in all down- 
stream analyses was evaluated. In all cases, the exclusion 
of candidate outlier markers resulted in no significant 
difference in the estimates of genetic distances and an- 
cestry coefficients (Additional file 2). Therefore, all sub- 
sequent analyses were conducted using the entire set of 
127 markers. 

Genetic distance-based clustering 

Different genetic distances were highly correlated (data 
not shown) and yielded consistent results (Additional 
file 3; Additional file 4: Figure SI; Additional file 5: Figure 
S2; Additional file 6: Figure S3; Additional file 7: Figure 
S4). We present the results obtained from Reynolds' dis- 
tance (Figure 1), which was shown to be insensitive to 
variation in the number of markers [34] . 

The Nigerian zebu breeds Sokoto Gudali and White 
Fulani were the closest related populations (Reynolds' dis- 
tance = 0.005). In contrast, in spite of a possible contribu- 
tion of Spanish ancestry to Brazilian cattle, Brazilian and 
Spanish breeds are well separated with the largest distance 
between Nellore and the inbred Betizu (Reynolds' dis- 
tance = 0.656). 

The first two eigenvectors of the CMDS analysis of 
continental groups of cattle (Figure IB) explained to- 
gether 79.4% of the total genetic variance, and were cen- 
tered on Southwestern Asian taurine cattle. The first 
eigenvector corresponds to the difference between tau- 
rine and zebu cattle with Southern Asian and South 
American zebu clustered together, and an intermediate 
position of Western African zebu cattle. The second 
eigenvector adds a geographical component correlating 
with the latitude of the region of origin of cattle popula- 
tions (Figure lA-B). The Neighbor-Net clustering 
method produced results similar to those found in the 
CMDS analysis (Figure IC). 

Model-based clustering 

The log-likelihoods obtained from the admixture model 
with independent allele frequencies, assuming K = 1 to 
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Figure 1 Reynolds' distance-based clustering of cattle according to continental areas. A) Continental areas sampled. Light brown = Southwestern 
Asia, purple = Eastern Europe, yellow = Central Europe, dark blue = Northern Europe, dark red = Southern Europe, orange = Western Europe, light green = 
Western Africa, dark green = Southern Asia and South America. Arrows indicate cattle migration routes. B) Classical multi-dimensional scaling plot. Circles: 
taurine cattle; triangles: zebu cattle. Percentages inside brackets correspond to the variance explained by each respective eigenvector. C) Neighbor-Net 
clustering. Nodes represent continental areas and edges are proportional to genetic distances. 



K = 60, were compared using AK and AIC weights in 
order to identify the most Ukely number of ancestral 
populations underlying the samples. Both AK and AIC 
weights selected the model with /C = 2 as the most likely 
among all competing models (Additional file 8: Figure 
S5). Assuming the two inferred clusters approximate the 
founder B. taurus and B, indicus populations (Figure 2A), 
we found variable levels of zebu introgression across 
taurine cattle breeds from all continental areas, which 
were especially marked in Southwestern Asian taurines. 
While South American zebu breeds did not present evi- 
dent taurine introgression, this was detected in all West- 
ern African zebu breeds. 

Higher K values were not supported by both M< and 
AIC weights and were not in agreement with genetic dis- 
tances (data not shown). This indicates that models with 
K > 2 were susceptible to stochastic errors and repre- 
sented poorly the underlying ancestry components of 
our samples. This may be due to model overfitting, by 
estimation of more parameters than allowed by the ob- 
served data. Hence, for our dataset, the model-based 
clustering analysis was limited to /C = 2 due to the low 
number of dominant markers and estimation of unob- 
served genotypes. 



Ancestry informative markers and expected heterozygosities 

We identified 6 taurine and 5 zebu ancestry informative 
markers via Af, and calculated the average band presence 
frequency for these markers across all breeds (Figure 2B). 
We observed that the taurine markers had in Western 
African zebus a higher frequency of band presence than 
in South American zebus, and the opposite was also 
found for zebu markers. 

We found a geographical pattern of decrease in the 
expected heterozygosity in taurine cattle, declining from 
Southwestern Asia to Western Europe and Western 
Africa (Additional file 9: Figure S6). Despite the limited 
sample size. Southern Asian zebus were estimated to be 
more diverse than the pools of taurine breeds. The esti- 
mate obtained for the closely related South American 
zebu was slightly lower than in Southwestern Asian tau- 
rines, but still higher than in European cattle. Further- 
more, Southern Asian and Western African zebus 
exhibited the highest expected heterozygosity among all 
continental groups analyzed. 

Discussion 

The performance of AFLP technology in cattle was previ- 
ously assessed and reported to produce genotyping data 
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Western European 
Taurine 



■ Central European 
I Taurine 



Nortliern European 
Taurine 

Southern European 
Taurine 



Eastern European 
Taurine 



Southwestern Asian 
Taurine 

Western African 
Taurine 



Ancestral population (K = 2) 

^Zebu 

H Taurine 



Western African 
Zebu 



IE 



B 



South American 
Zebu 



Taurine Ancestry Informative Markers 



■ Taurine 

□ Western African Zebu 

□ Soutli American Zebu 




<mmH^z=>a:m-J<=FZCCQOFrO>;Fdi^QC5LiJa:a:H^^5:irozQZQ-ira:o^Qcnf^:>z^(3d 
mzmmfQ.cnH<<xSr5<-'^£>lfE^i^2^0OTm=)^f^<oLiJOO=50<E^Oo£<L^ 

<<mcQmmcQmoc>o""ciQLij^Lu^^ocscix ^ ^ -' -' ^ ^^^^^^z ^ °- °- 0.0.^ a: ir^ a: (r>^^ h h > > o o c>o ^ 'z.^ (D ^ ^ 

Zebu Ancestry Informative Markers 



,isiii;D__5fii5=aifisa5_sfi_S£5Diii_iii_iii_Ji: 



ii 



<mcQH^Z=>a:m-J<xZDCQOFFO>;Fdl^QCDLiJCCCCI-2^5CC(J32QZQ-ira:O^QCnfr^Z^CDd 

mzmmfQ.cnH<<x55<-Ju-£>?«E<mz3Owm3^^|<oLiJ00=50<^ 

<<m£niiimcQCQoOo"-'"QQ^ ujLL^Oc5(Tjx =^ ^^^2M^^z'^ '^cLQ. a:[r^Q:mMM„5>i-i_|_>>ooOo2zzCi2i- 

Figure 2 Admixture analysis of taurine and zebu cattle. A) Model-based clustering of cattle breeds under the admixture model with 
independent allele frequencies and 2 assumed ancestral populations (K). Each individual is represented by a vertical bar that can be partitioned 
into colored fragments with length proportional to cluster contribution. B) Bar plots of band presence frequencies for the set of taurine (above) 
and zebu (below) ancestry informative markers. Bar errors represent standard errors. See Table 1 for breed codes. 



with an error rate equal to or less than 2% across labora- 
tories [16], which is consistent with the repeatability of the 
data reported in the present study (Additional file 1). Here, 
we revisited the use of AFLPs to investigate the relation- 
ship among 13 zebu and 53 taurine cattle breeds. As AFLP 
markers are discovered as samples are genotyped, the as- 
sessment of genetic structure and differentiation reported 
in this article is free of ascertainment bias. 

As expected, the largest genetic distances were found 
between zebu and taurine breeds (Additional file 3). 
The Bayesian-clustering analysis also highlighted that 
these populations descend from distinct genetic pools 



(Figure 2). We found a decrease of the genetic diversity 
correlating with geographical distance to Southwestern 
Asia (Additional file 9: Figure S6). This observation is in 
agreement with the mitochondrial DNA (mtDNA) find- 
ings of Troy et al [35], which suggested a Southwestern 
Asian origin of European cattle with Anatolia or the Fer- 
tile Crescent as the most likely centre of taurine cattle 
domestication. Hence, the loss of diversity with increas- 
ing distance from the most plausible domestication 
centre as observed here is in line with the hypothesis that 
the ancestral taurine genetic pool was derived from the 
wild aurochs captured in Southwestern Asia. Apparently, 



Utsunomiya et al. BMC Genetics 2014, 15:47 
http://www.bionnedcentral.conn/1471-21 56/1 5/47 



Page 8 of 10 



any introgression from European or African aurochs was 
not at such a large scale that it effectively counteracted the 
loss of diversity during migration from Southwestern Asia. 

Using sequence data of 17 genes, spanning 37 kb, 
Murray et al [36] found the nucleotide and haplotype 
diversity in B, indicus to be higher than in B, taurus. In 
the present study, we also found that the expected hetero- 
zygosity in the South American zebu breeds was higher 
than in the European taurine breeds. Considering that the 
South American zebu breeds analyzed here were intro- 
duced in the American continent in the early 20^^ century 
by import of Indian animals, this finding is also consistent 
with a separate origin of B, indicus in South Asia. 

The expected heterozygosity in Southern Asian cattle 
was estimated to be higher than the closely related 
South American breeds. Although this finding is consist- 
ent with loss of diversity during sampling and import- 
ation of animals to South America, Southern Asian 
cattle were represented by few samples in our dataset, 
and the assessment of the extent of genetic diversity in 
this continental group is limited. However, these results 
support that the B, indicus species are at least as diverse 
as B, taurus cattle. 

The CMDS and Neighbor-Net analyses showed that 
zebu cattle from South America are more closely related 
to Southern Asian cattle than Western African zebu 
(Figure 1). Furthermore, except for Southern Asian zebus. 
Western African zebu breeds presented the highest ex- 
pected heterozygosity among all continental groups. Most 
likely, this was due to a relatively higher level of admixture 
[5,37,38]. 

The closer proximity of Western African zebu to tau- 
rine cattle in the CMDS plot and in the Neighbor-Net of 
Reynolds' distances also suggests that African zebus are 
more admixed with taurine cattle than South American 
zebus (Figure 1). This observation is reinforced by the 
model-based clustering and the ancestry informative 
markers analyses, where these African breeds seemed to 
carry substantial levels of taurine introgression (Figure 2). 
This may reflect that zebu cattle and taurine-zebu cross- 
breds in Africa resulted from crosses between taurine 
dams and zebu sires as shown by their taurine mtDNA 
haplotypes: import of zebu sires started in the 2^^ millen- 
nium BC and was stimulated by the Arabian invasions in 
the 7^^ century [4,39]. However, it is also plausible that 
this taurine inheritance played a role in local adaptation. 
For instance, trypanosomiasis is endemic in the Western 
Sub-Saharan region, and whereas indigenous taurines are 
tolerant, zebus may show variable susceptibility. 

Similar crossbreeding was carried out in South Amer- 
ica. When in the early 20^^ century the import of large 
numbers of zebu cattle to Brazil started, the indigenous 
herds mainly consisted of descendants from the taurine 
cattle imported since the late 15^^ century after the 



discovery of America. The model-based clustering ana- 
lysis clearly showed a genetic composition of Brazilian 
zebu close to their Indian ancestors (Figure 2A-B), indi- 
cating intensive backcrossing to zebu bulls during sev- 
eral generations. So while mtDNA is a fingerprint of the 
historical origin of the herd and is probably randomly 
segregating [40,41], the nuclear genome has been subject 
to directional selection against taurine haplotypes via 
backcrossing. Thus, artificial selection may have retained 
taurine haplotypes only if these were linked to favourable 
traits (e.g., weight, carcass, etc.). Applying whole genome 
sequence data or a high density SNP array may be useful 
to identify taurine haplotypes favoured by selection in 
these populations. 

Ancestry informative markers also detected zebu 
introgression in the taurine gene pool (Figure 2). The 
highest level of introgression was found in Southwestern 
Asia, as previously observed with microsatellites [37]. 
This event likely contributed to the highest diversity that 
is observed in this area and, therefore, should not be at- 
tributed entirely to the vicinity of Southwestern Asian 
breeds to the putative B, taurus centre of domestication. 
A low level of admixture was also detected in Southern 
and Central Italian breeds, the Sicilian Cinisara and 
Modicana in particular, confirming a previous report 
[42]. The zebu admixture appears to decrease across the 
Alps towards Central and Western Europe with few ex- 
ceptions (e.g., Aberdeen Angus). Interestingly, we con- 
firmed the low level of B. indicus introgression in 
Pinzgauer breed postulated by Caroli et al [43] on the 
basis of casein haplotype structure in Austria, but did 
not detect substantial zebu ancestry in the Piedmontese 
breed as previously suggested [44]. Given the limited 
number of ancestry informative markers (5 zebu and 6 
taurine), these results are only indicative and can be 
confounded by stochastic variation. 

Conclusions 

We used AFLP markers to set an unbiased baseline for 
multi-breed taurine and zebu cattle genetic structure 
and divergence. These markers suggested that zebu 
breeds are at least as diverse as taurine cattle, but fur- 
ther investigation is needed to determine if zebu cattle is 
more diverse than taurine cattle. We found a gradual 
loss of diversity in taurine breeds departing from the do- 
mestication centre, which is consistent with previous 
findings. Western African zebu breeds are more genetic- 
ally distant to Indian zebus than South American zebu 
cattle by substantial taurine introgression. Although the 
South American zebus also have maternal taurine intro- 
gression, most of the taurine component of the nuclear 
genome seems to have disappeared through backcrossing. 
Furthermore, the AFLP data indicated limited zebu intro- 
gression in the Italian Podolian breeds. 
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